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Abstract 

We propose a tuner, suitable for adaptive control 
and (in its discrete-time version) adaptive filtering ap- 
plications, that sets the second derivative of the param- 
eter estimates rather than the first derivative as is done 
in the overwhelming majority of the literature. Com- 
parative stability and performance analyses are pre- 
sented. 
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1 Parameter Adjustment 

In adaptive control and recursive parameter es- 
timation one often needs to adjust recursively an esti- 
mate p of a vector p, which comprises n constant but 
unknown parameters, using measurements of a quan- 
tity 



y 



x p ■ 



(1) 



Here x : [0, t) — > R™ is a vector of known data, often 
called the regressor, and w is a measurement error sig- 
nal. The goal of tuning is to keep both the estimation 
error x T p — y and the parameter error p — p as small 
as possible. 

There are several popular methods for dealing 
with the problem above, for instance least-squares. 
Maybe the most straightforward involve minimizing 
the prediction error via gradient-type algorithms of the 
form: 



P 



-Mx(x T p — y), 



(2) 



where M is a constant, symmetric, positive-definite 
gain matrix. Let us define q = p — p and analyze diffcr- 
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ential equations (0) and which under the assump- 
tion that w(t) is identically zero read: 



-Mxx 1 q. 



(3) 



The nonnegative function V — \q T M 1 q has time 
derivative V = q 1 M~ x q = —q T xx T q, hence 



V = V(t) - V(0) 



{x^qf 



Inspection of the equation above reveals that V 
is limited in time, thus q £ C°° , and also that the error 
x T q E C 2 (norms are taken on the interval [0,t) where 
all signals are defined). These are the main properties 
an algorithm needs in order to be considered a suit- 
able candidate for the role of a tuner in an adaptive 
control system. Often q £ C 2 or something similar is 
also a desirable property. To obtain the latter, normal- 
ized algorithms can be used; however, the relative mer- 
its of normalized versus unnormalized tuners are still 
somewhat controversial. Another alternative is to use 
a time- varying M , as is done in least-squares tuning. 

In §|| we present a tuner that sets the second 
derivative of p, and in §|| the effects of a white noise 
w on the performance of the two algorithms are com- 
pared. Then we show some simulations and make con- 
cluding remarks. 



2 The Accelerating Algorithm 

Classical tuners are such that the velocity of 
adaptation (the first derivative of the parameters) is 
set proportional to the regressor and to the prediction 
error x T p — y = x T q. We propose to set the accelera- 
tion of the parameters: 
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Notice that the the formula above is implementable 
(using 2n integrators) if measurement error is absent, 



because the unknown q appears only in scalar prod- 
uct with x. Choose another function of Lyapunovian 
inspiration: 



V = q T q + q T q + q T q 
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> 0. 



Taking derivatives along the trajectories of (|]) gives 

V = 2q T q + q T q - (q T + 2q T ){2q + xx T q + 2xx T q) 
= -3g T g -(q + 2q) T xx T (q + 2q) < 0. 

Integrating V we obtain 

2 



v(t) = v(o) -Jri-J [x (q + n) J 

which leads immediately to the desired properties: 
g,ge£°°; qeC 2 ; x T (q + 2q)e£ 2 . 

The slow variation property q £ C 2 follows with- 
out the need for normalization, and now we obtain 
x T (q + 2q) G C 2 instead of x T q G C 2 as before. We 
might regard x T (q + 2q) as a modified error, which 
can be used in the stability analysis of a detectable 
or "tunable" adaptive system via an output-injection 
argument; see A generalization of (Q) is 



q = -Mi [x{x T p -y) + 2{M 2 + xx T M 1 M 3 )qJ , 



(5) 



with M\,M2 and M3 constant, symmetric, positive- 
definite n x n matrices such that M^ 1 < &M1M3M1 
and M2M1M3 + M 3 MiM 2 > Mf 1 /2. The properties 
of tuner (||), which can be obtained using the positive- 
definite function 



q 



M 2 Mf 72 
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q 




q_ 



in the same manner as before, are 

q,qeC°°; qeC 2 ; x T (q + 2M 1 M 3 q) G C 2 . 

3 Covariance Analysis 

We now consider the effects on the expected value 
and covariance of q(t) of the presence of a measure- 
ment error. The assumptions are that w(t) is a white 
noise with zero average and covariance o^5(t — r) and 
that x(t) are given, deterministic data. For compar- 
ison purposes, first consider what happens when the 
conventional tuner (^) is applied to (Q) in the presence 
of measurement error w: 



-Mxx T q + Mxw. 



f) 



The solution to the equation above can be written in 
terms of (— Mxx T )'s state transition matrix <p(t,T) as 
follows 



q(t) = 4>(t, 0)q(0) + / dr(j)(t,T)Mx(T)w(T), 
Jo 

hence E w {q(t)} — <f>{t, 0)g(0) because E{w(t)} = by 
assumption. Here the notation E w {-}, denoting the 
expectation with respect to the random variable w, is 
used to emphasize that the stochastic properties of x 
are not under consideration. The conclusion is that q(t) 
will converge to zero in average as fast as <j>(t, 0) does. 
The well-known persistency of excitation conditions on 
a;(-) are sufficient for the latter to happen. 

To study the second moment of the parameter 
error, write 

qq T (t) = <P(t,0)qq T (0)<p T (t,0) 

+ <p{t,0)q(0) I dTw{T)x T {T)M<f> T (t,T) 



dr (j){t,T)Mx{r)w{T) q T (0)(j) T (t, 0) 







+ ( J dT<p(t,T)Mx(T)w(T) 
rt 



dTw(r)x [ (r)M0 (t,r) 







The covariance of q(t) can be written as the sum of four 
terms. The first is deterministic. The second term 

E w {<t>(t,0)q(0) j dTw(T)x T (T)M(f> T (t,T)} 
= i>(t,Q)q(0) [ dTE{w(T)}x T (T)M(f) T (t,T) =0 



because w(t) has zero mean, and the third term is like- 
wise zero. The fourth term 



E w { J dn4>(t,Ti)Mx(Ti)w(Ti) 



dT 2 w(T 2 )x> (r 2 )M0' (t,T 2 ) 



t pt 
dr\ j d,T2<i>{t,T\)Mx{T\) 

n JO 

E{w(t 1 )w(t 2 )}x t (t 2 )M<P t (t,T 2 ) 



a 2 w / dn^T^Mx^x 1 {Tx)M<t> T {t,rx), 



where Fubini's Theorem and the fact E{w(ti)w(t 2 )} = 
(T„(5(ti — t 2 ) were used. Performing the integration and 
adding the first and fourth terms results in 

E w {qq T (t)} = ^(t,0)qq T (0)^(1,0) 



^(M-0(t,O)M0 T (t,O)). 



This equation can be given the following interpretation: 
for small t, when 0(i, 0) is close to the identity, the co- 
variance of q(t) remains close to qq T (0), the outer prod- 
uct of the error in the initial guess of the parameters 
with itself. As <j>(t, 0) — > 0, which will happen if x(-) 
is persistently exciting, E w {qq T (t)} tends to er^M/2. 
This points to a compromise between higher conver- 
gence speeds and lower steady-state parameter error, 
which require respectively larger and smaller values of 
the gain M. Algorithms that try for the best of both 
worlds — parameter convergence in the mean-square 
sense — often utilize time- varying, decreasing gains; 
an example is the least-squares algorithm. 

We shall now attempt a similar analysis for the 
acceleration tuner (||) applied to (0), which results in 
the differential equation 
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Let 
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-2Afi (M 2 +xx T M1M3) 



911 912 
t>2l 022 



where 0n(t,*) = 4> 22 (t,t) = / <j>Mt,t) = <&ji(M) = 0, 
each (pa is a function of (t, r) unless otherwise noted, 
and the dot signifies derivative with respect to the first 
argument. If q(0) = 0, 



q{t) =0n(t,O)g(O)+ / dr4> 12 (t,T)M lX {T)w(T). 
Jo 



Following the same reasoning used for the veloc- 
ity tuner, one concludes that E w {q(t)} = 0n(t, 0)g(0) 
and that 

E w {qq T (t)} = 11 (i,O) M T (O)0^ 1 (i,O) + 

a 2 w [ dr0 12 (i,r)M ira T (r)M 1 ^ 2 (t,T), (6) 
Jo 

however the properties of the acceleration and veloc- 
ity tuners are not yet directly comparable because the 
right-hand side of (6) does not lend itself to immediate 
integration. To obtain comparable results, we employ 
the ungainly but easily verifiable formula (7), valid for 
arbitrary scalars /ii and M2, and make the 



Simplifying Assumption: For i = 1,2, and 3, Mi = 
niil, where TOj are scalars and I is the n x n identity 
matrix. 

Premultiplying (7) by [/ 0], postmultiplying by 
[I 0] T , integrating from to t, and using the simplifying 
assumption gives formula (8). Taking [i\ = /i 2 = in 



(8), M 2 results positive-semidefinite, therefore 

dr 0i 2 (t, t)M iX x t (t)Mi0^ 2 (t, r) 
1 



0nM) 
012 (*,0) 



< -j— 2 7 TOim 3 J 

4mjm2m3 — 1 \ 

-[0n(i,O) 0i 2 (t,O)]Mi 



(9) 



The combination of (6) and (9) shows that mi 
can be increased without affecting q's steady-state co- 
variance. On the other hand, to decrease the covari- 
ance we need to increase 777,2, which roughly speaking 
means increasing damping in (Q). Since mi and m 2 
can be increased without affecting the stability prop- 
erties shown in §||, a better transient x steady-state 
performance compromise might be achievable with the 
acceleration tuner than with the velocity tuner, at least 
in the case when Mi, M2, and M3 are "scalars." Notice 
that 4t77i 7772 7773 > 1 by construction. 

Approximate Analysis: The derivation of inequality 
(9) does not involve any approximations, and therefore 
provides an upper bound on E w {qq T (t)}, valid inde- 
pendently of x(-). A less conservative estimate of the 
integral in (6) can be obtained by replacing xx T by its 
average value R in the definition of M% in (8). This 
approximation seems reasonable because M2 appears 
inside an integral, but calls for more extensive simula- 
tion studies. 

To obtain a useful inequality, we require M2 > 0; 
namely, using the Schur complement 

4(1 + fi 2 )M 1 M 2 M 1 M 2 M 1 > 
M 1 (^t 1 xx T -(2+n 2 )M 2 )M 1 (p 1 xx T -{2+^ 2 )M 2 )M 1 , 

or, using the simplifying assumption and substituting 
xx T by its approximation R 



4(l + /i 2 )7> (2 + Ma)/ 



Mi 

777 2 



R 



Suppose further that R = rl. Looking for the least con- 
servative estimate, we pick /ii = ^(2+^, 2 —2y/l + fi 2 ), 
the least value of Mi that keeps M 2 > 0. Thus 



dr (j) 12 (t,T)M 1 xx T '(T)Mi<f>J 2 (t,T) < 



m\m 3 r + mim 2 ((i 2 + 2 - 2y/l + /i 2 ) 



Am\m 2 m^r{\ + /i 2 ) — r 



i-a(t), (10) 



with H(t) 



[<t>u(t,o) <j> 12 {t,0)]M 

4m'jm2m 3 r(l+iJ,2)—r 



Taking /i 2 — we repeat the previous, exact re- 
sult. For large positive values of fi 2 the first term of the 
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-M 1 xx T M 1 + 2(1 + li 2 )(M 1 M 2 M 1 M 3 M 1 xx t Mx + M 1 xx T M 1 M 3 M 1 M 2 M 1 ) 

M1M3M1 + /iiMi -Mi/2 

-Mi/2 (1 + ^ 2 )MiM 2 Mi 

Mi fuMxxx 1 Mi - (2 + fi 2 )M 1 M 2 M 1 

li 1 M 1 xx T M 1 - (2 + ii 2 )M x M 2 M x 4(1 + ^ 2 )MiM 2 MiM 2 Mi 



where Mi 
M 2 



(7) 



dr (4(1 + [i 2 )m\m 2 m z - l)0i 2 (t, r)Mia;x T (T)Mi07 2 (t,T) 



t(t,T) 0i 2 (t,r)]Mi 



0n(V 

012^' 



r=0 



M 2 



4 "1 

^2 



(8) 



right-hand side of (10) tends to I /Amim^r, which indi- 
cates that the steady-state covariance of the parameter 
error decreases when the signal x increases in magni- 
tude, and that it can be made smaller via appropriate 
choices of the gains m± and 1713. The situation for the 
accelerating tuner is hence much more favorable than 
for the conventional one. 



4 Simulations 

The simulations in this section compare the be- 
havior of the accelerating tuner (f|) with those of the 
gradient tuner (^) and of a normalized gradient one. 
All simulations were done in open-loop, with the regres- 
sor a two-dimensional signal, and without measurement 
noise. Figure [l] shows the values of \J q T q and x T q re- 
spectively when a; is a two-dimensional step signal. In 
Figure || the regressor is a sinusoid, in Figure | an expo- 
nentially increasing sinusoid, and in Figure E a pseudo- 
random signal generated using matlab. No effort was 
made to optimize the choice of gain matrices (Mi, M 2 , 
and M3 were all chosen equal to the identity) , and the 
effect of measurement noise was not considered. The 
performance of the accelerating tuner is comparable, 
and sometimes superior, to that of the other tuners. 



5 Concluding Remarks 

Other ideas related to the present one are replac- 
ing the integrator in (|j|) with a positive-real trans- 
fer function jl|, page 89], and using high-order tun- 
ing (|^, Q]). High-order tuning generates as outputs 
p as well as its derivatives up to a given order (in this 
sense we might consider the present algorithm a second- 



order tuner), but unlike the accelerating tuner requires 
derivatives of x up to that same order. We expect 
that accelerating tuners will find application in adap- 
tive control of nonlinear systems and maybe in dealing 
with the topological incompatibility known as the "loss 
of stabilizability problem" in the adaptive control lit- 
erature. 

The stochastic analysis in §|| indicates that the 
performance and convergence properties of the acceler- 
ating tuner, together with its moderate computational 
complexity, may indeed make it a desirable tool for 
adaptive filtering applications. It seems that a bet- 
ter transient x steady-state performance compromise 
is achievable with the accelerating tuner than with the 
velocity tuner. To verify this conjecture, a study of con- 
vergence properties of the accelerating tuner and their 
relation with the persistence of excitation conditions is 
in order, as well as more extensive simulations in the 
presence of measurement noise. 
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Figure 2: Response to sinusoidal data 
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Figure 3: Response to exponentially increasing sinusoid 
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Figure 4: Response to pseudorandom signal 



